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1.  Introduction 

Consider  two  binomial  populations  and  ^ having  success  proba- 

bilities 0 < p^  < 1 and  0 < p2  < 1 respectively.  Experimenters  have 
used  a variety  of  measures  for  comparing  1T1  and  *2  including  the  odds 
ratio  t = p1(l-p2)/(l-p1)p2>  the  relative  risk  p = P-^/Pj  and  the  dif- 
ference between  the  success  probabilities  A i p1  - P2*  See  Cornfield  (1956), 

Gart  (1971),  Gail  (1973),  Dunnett  and  Gent  (1977),  and  Katz  et.  al.  (1977) 
for  comparisons  of  these  measures  and  examples. 

When  p^  and  p2  are  unknown  the  statistician  is  interested  in  con- 
structing confidence  intervals  for  the  measure  of  interest  based  on  a 2 * 2 
contingency  table  of  data  formed  from  independent  random  samples  of  sizes 
and  N2  from  and  n2,  respectively. 

For  the  measure  both  exact  (small  sample)  confidence  intervals 
(Cornfield  (1956)  and  Katz  et  al.  (1977))  and  asymptotic  (large  sample) 
confidence  intervals  (Cornfield  (1956)  and  Gart  (1971)  among  others)  have 
been  devised.  Computer  programs  for  implementing  the  exact  methods  of 
Cornfield  and  Katz  et.  al.  have  been  published  by  Thomas  (1971)  and  Baptista 
and  Pike  (1977)  respectively. 

Asymptotic  confidence  intervals  for  p have  been  proposed  by  a number 
of  authors.  Katz  et.  al.  (1977)  contains  a summary  and  comparison  of  five 
such  intervals.  There  have  also  been  a number  of  asymptotic  confidence 
intervals  proposed  for  A (Gart  (1971)).  Buhrman  (1977)  proposed  methods 
for  designing  and  analyzing  an  experiment  which  yields  exact  confidence  in- 
tervals for  p and  A.  Thomas  and  Gart  (1977)  have  published  a method  for 
constructing  "exact"  confidence  intervals  for  p and  A in  undesigned  exper- 
imenta.  However,  as  we  show  below,  their  A intervals  do  not  satisfy  the 
conditional  confidence  guarantee  they  claim  nor  even  a corresponding 
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unconditional  confidence  guarantee.  Katz  et.  al.  (1977)  give  conditional 
counterexamples  for  their  p intervals. 

This  paper  proposes  two  methods  for  constructing  exact  100(l-a)% 
confidence  intervals  for  A in  undesigned  experiments.  It  will  be  in- 
dicated how  the  methods  can  be  modified  to  determine  exact  confidence  in- 
tervals for  p.  We  begin  by  reviewing  that  part  of  the  basic  theory  of 
2 x 2 tables  required  for  our  later  work;  we  then  give  two  examples  which 
illustrate  the  problem  of  the  intervals  in  Thomas  and  Gart  (1977). 

Let  X^  and  be  the  numbers  of  successes  based  on  independent 

random  samples  of  sizes  and  from  it^  and  respectively  and 

let  X = (X^jX^).  The  joint  probability  mass  function  of  X^  and  X^  is 
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Also  of  interest  is  the  conditional  distribution  of  given  x^  + X^  = m 

which  is  given  by 


(1.2)  g(  j 


£(m)  ,*,(m)+l, . . . ,u(m) 


where  *.(m)  = max{0,  m-Nj}  and  u(m)  5 minCm.N,^.  Expression  (1.2)  is 

valid  for  any  integer  m between  0 and  inclusive;  g(*jm,ij») 

is  degenerate  at  x equal  to  0 and  Nn  + N„  when  m = 0 and  N + N 

x 12  12 

respectively.  In  other  cases  g( * |m,^)  depends  on  p1  and  p?  only 
through  the  odds  ratio  ip. 

Both  Cornfield  (1956)  and  Katz  et.  al.  (1977)  have  proposed  methods 
for  constructing  exact  two-sided  100(l-a)%  confidence  intervals 

*U<«))  f°r  If  ^ denotes  a probability  calculated 

under  distribution  (1.2)  then  both  of  their  intervals  satisfy 

(1.3)  < * < I1'01 

for  every  ip  e (0,«)  and  every  integer  0 <_  m <_  ^ + N2-  Hence  they  satisfy 
the  (unconditional)  confidence  interval  guarantee 

a.*)  ppi,4t*L<>‘>  < * < *„<«3  i i-. 


for  all  p^  and  A.  The  method  of  Katz  et.  al.  will  be  reviewed  since 
it  yields  shorter  intervals  than  the  Cornfield  method  and  the  tables  of 
Section  2 are  based  on  their  intervals. 
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Fix  m € {0,1,  — ,N.  + N„)  and  for  each  i|>  e (0,°°)  let  A = A (m) 

1 2.  ijj 

be  the  subset  of  {£(m) , . . . ,u(m)}  so  that  P^[X^  e A^|m]  >_  1-a  and 
g(k|m,i|>)  ^g(j|m,iM  for  all  k ^ A^  and  all  j e A^.  Define  the 
confidence  interval  for  i p to  be  {\l>  e (O,00)^^  e A^CX^+X^)} . It 
is  easy  to  check  that  the  resulting  interval  satisfies  (1.3). 

In  a series  of  papers  Gart  (1971),  McDonald  et.  al.  (1974)  and 
Thomas  and  Gart  (1971)  attempt  to  produce  exact  small  sample  confidence 
intervals  for  A based  on  intervals  ^(JjC))  satisfying  (1.3) 

and  on  the  following  relationships.  To  construct  the  upper  limit 
A^  = Ay($)  first  determine  the  solution  Xy  of  the  equation 


(1.5) 


Xy(Xy+N2-m) 

” (m-Xy )(N^-Xy) 


satisfying  max{0,  m-N2>  < Xy  < mintm,^}.  Then  let  Ay  = Xy/J^  - (m-Xy)/^. 

The  lower  limit  A can  be  obtained  in  a similar  fashion.  These  authors 
L 

claim  that 


(1.6) 


< A < V$>M  -1_a 


for  all  1 1>  e (O,00)  and  hence  (A^(JJ),  Ay(X))  also  satisfies  the  probability 
guarantee  unconditionally.  This  method  has  two  problems.  The  first  is  that 
Equation  1.5  defining  Xy  is  derived  on  asymptotic  grounds  (Cornfield  (1956)). 
The  second  is  that  if  m f or  m f then  A^  or  Ay  are  bounded 
away  from  -1  or  1.  This  second  problem  will  be  examined  in  more  detail 
in  Section  4.  Consequenty  it  is  not  surprising  that  (1.6)  need  not  hold  for 
small  samples  as  the  following  example  shows. 
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Example  1.1:  Let  ^ = 2 = N2»  m = 1 and  a = .01.  McDonald  et.  al.  (1974) 
compute  the  following  99%  confidence  intervals  for  A based  Gart's  method. 


X1 

x„ 

A 

A 

1 

2 

L 

U 

0 

1 

-1 

.4808 

1 

0 

-.4808 

+1 

Consider  pL  = 3/4  and  p2  = 1/4  so  that  A = 1/2  and  <|>  = 9. 

" 1/2  *=  y*>|1]  = V9[xi=1»  x2=°i1]  = -9o < 

As  Example  1.1  is  a conditional  probability  calculation  it  might 
still  be  conjectured  that  this  method  satisfies  the  unconditional  probability 
requirement 

(l-7)  ‘ a * 4„  i i- 

for  all  A e (-1,1)  and  px  e 1(A)  provided  (A^Ay)  is  correctly  defined 

for  m = 0 and  ^ + N2  since  the  corresponding  interval  is  undefined 

for  these  two  outcomes.  In  the  following  example  (A  ,A  ) is  defined  to 

L U 

be  (-1,1)  when  m = 0 or  + N2> 


frggEj-g.  .1-2:  Again  choose  ^ = 2 = N2  and  o = .01.  The  conditional 
confidence  intervals  of  McDonald  et.  al.  (1974)  are 
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X1 

X2 

al 

Au 

0 

2 

-1 

.4811 

0 

1 

-1 

.4808 

1 

2 

-1 

.4808 

1 

1 

-.9316 

.9316 

0 

0 

-1 

+1 

2 

2 

-1 

+1 

2 

1 

-.4808 

+1 

1 

0 

-.4808 

+1 

2 

0 

-.4811 

+1 

When 

P3/4,l/2 
= .949  < 


= 3/4 

CA.(X) 
L 'v 

.99. 


and  p2  = 1/4  then 
< 1/2  < AU(X)]  = 1 - 


A = 1/2,  ip  = 9 and 

P3/4,1/2C(X1’V  e U0,2),(0,l)(l,2)}] 


In  Section  2 a method  of  constructing  exact  confidence  intervals  for 
A based  on  conditional  ip  intervals  will  be  proposed  which  satisfies  the 
conditional  confidence  guarantee  (1.6)  and  hence  the  weaker  unconfidence 
guarantee  (1.7).  Section  3 discusses  a second  method  for  constructing  A 
intervals  which  directly  attempts  to  satisfy  (1.7)  rather  than  the  conditional 
statement  (1.6).  Section  4 gives  an  example  and  draws  some  comparisons 


between  the  two  methods  while  Section  5 summarizes  the  results  and  makes 
some  recommendations  regarding  the  use  of  Thomas-Gart  (1977)  intervals. 
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2.  Conditional  Confidence  Intervals 

For  a given  4*  e (0,°°)  there  are  infinitely  many  (p^,P2>  pairs 
or  equivalently  (p1»A)  pairs  associated  with  that  4>  value.  For  each 
4>  e ( 0,°°)  let 

(2.1)  D(40  = (A  £ (-l,l)|3p1  e 1(A)  3 p^l+A-p^/a-p  )(p  -A)  = 4/) 

be  the  set  of  differences  associated  with  the  odds  ratio  4>. 

The  idea  of  the  method  is  to  use  the  set  of  all  differences.  A'  , 
associated  with  ip's  in  an  interval  (4>L($),  4^$))  satisfying  the  condi- 
tional confidence  guarantee  (1.3)  as  the  confidence  interval  for  A.  It 
will  be  shown  below  (Theorem  2.1)  that  the  resulting  confidence  interval 
will  also  have  conditional  confidence  level  (1-a)  and  hence  unconditional 
confidence  level  (1-a). 

Formally  the  intervals  are  defined  as  follows.  For  any  (a,b)  c (0,*) 
let 


(2.2)  E(a,b)  = u D(40. 

(a,b) 

E(a,b)  is  the  set  of  all  A’s  representable  as  ~ P2  ^or  soine  (p^p^ 
satisfying  p^(l-p2)/(l-p1)p2  £ (a,b).  E(a,b)  is  always  non-empty  since 
the  equation  y = p^(l-p2)/(l-p1)p2  always  has  solutions  satisfying 
0 < P^»  P2  < 1 for  any  0 < y < ® . The  interval  boundaries  A^  and 
are  defined  in  terms  of  (4^*4^)  and  £(•,•)  as  follows: 


(2.3) 


Al(X)  = inf  E(4'l»4'u)  and 
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(2.4)  A^X)  = sup  EC^.iJ^). 

The  following  characterization  of  E(a,b)  will  simplify  the  calculation  of 
the  sup  and  inf  given  in  (2.3)  and  (2.4). 

Lemma  2.1.  E(a,b)  = (A  = p.^  - (l-p^)^  + P1)|(p1>^)  e (0,1)  x (a,b)}. 

Proof . If  A e E(a,b)  then  there  exists  £ 1(A)  c (0,1)  so  that 
P1(l+A-p1)/(l-p1)(p1-A)  = ip  for  some  ip  in  (a,b).  Solving  for  A shows 
A is  in  the  right  hand  set  above.  Now  suppose  A is  in  the  right  hand 
set  then  there  is  (p^d)  e (0,1)  * (a, b)  satisfying  A = p^  - p^/td-p^i/*  + p^). 
It  follows  that  A £ (-1,1)  since  0 < p^  < 1 and  ip  > 0 . Solving  for 
p gives  ip  = p^(l+A-p^)/(l-p^)(p  -A);  it  remains  to  show  p^  e 1(A)  in 
order  that  A £ Dd)  c E(a,b).  Three  cases  are  possible:  (1)  -1  < A < 0, 

(2)  A = o and  (3)  0 < A < l.  Only  the  first  case  will  be  considered 
as  the  remaining  two  are  similar.  It  suffices  to  show  p^  < 1+A  since 
1(A)  = (0,  1+A)  when  -1  < A < 0.  But  0 < ip  = p (1+A-p  )/(l-p  )(p  -A)  < 00 
implies  0 < 1-p^+A  since  mintp^,  l-p^jp^-A}  > 0 when  -1  < A < 0 and 
the  proof  is  completed. 

Let  A(p1»i<<)  = pL  - p1/((l-p1)ij;  + p^  and  R(a,b)  = (0,1)  x (a, b).  From 
(2.3),  (2.4)  and  Lemma  2.1  the  100(l-a)%  conditional  confidence  limits 
Ar(X)  and  A.,(X)  corresponding  (ipT  ,ip.,)  are  the  solutions  of  the  following 

Li  'V/  U 'Xj  Li  U 

optimization  problems: 

Ar  (X)  = inf  A(p.d)  and 

R<V*U> 

A (X)  = sup  A(p  d). 

*<VV 


- - 
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First  a lemma  describing  the  behavior  of  A(p1»i|')  as  a function  p^ 
for  fixed  will  be  given. 


Lemma  2.2. 


(a)  For  any  fixed  0 < <P  < 1,  A(p^,ip)  is  a (strictly)  negative  convex 

further 

/a  - 1 


function  in  p ; furthermore  sup  A(p  ,ifi)  = 0 and 

Pie(0,l) 


= A 


_/L. 

^ + i 


inf  A(p  ,if/) 
P1e(0,l) 


•» 


/p  + 1 


(b)  For  ^ = 1,  A(p^,l)  = o for  all  p^  e (0,1). 

(c)  For  any  fixed  1 < ip  < <*>,  A ( p^ , ) is  a (strictly)  positive  concav 

sup  A(p  ,i|/)  = A 


yip 


function  in  p ; furthermore  __r 

pie(0,l)  1 \/ip  + 1 

and  inf  A(p  ,i^)  = 0. 


, *)  = 

J /ir  + 1 


P^O.l) 

Proof . Case  (b)  is  immediate  from  the  definition  of  A(p^,i|i).  it  suffices 

to  prove  (a)  since  (c)  follows  from  (a)  and  the  easily  verifiable  relationship 

A(p^,i/<)  = -A(l-p^,  1/ip).  Fix  0 < 4>  < 1;  for  any  0 < p^  < 1 we  have 

Pl 

(l-p^i*/  + px  < 1 and  hence  A(Pl,t|>)  = PL  - ~1  y,,,  * n < P-,  - P,  = 0. 


(l-p1)i</  + P1  F1  *1 


Taking  derivatives  of  A(p^,^)  wrt  p^  gives 


(2.5) 


3A 

3Pt 


= 1 - 


and 


((1-P1)^  +P1)' 


(2.6) 


3 A _ 2i|)(l-i|>) 

9pi  Cp1d-<i')+'p)3 


32A 

Now  — =•  > 0 for  0 < p.  < 1 since  0 < < 1 and  hence  A(p1  ,ty)  is 

3Pi 


iirhr1,i,aMg-*i 
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convex  in  p^.  The  minimum  of  A(p^,^)  occurs  at  the  solution  of 
3 A . 

3pT  ~ 0 i-e.,  at  P;L  = — while  lim  A(p  ,*)  = lim  A(p, ,*)  = 0 

1 •'*  + 1 Pj^v  1 p 1 

and  hence  the  supremum  of  A over  (0,1)  is  0.  This  completes  the  proof. 

Figure  A is  a graph  of  A(P;L,ip)  vs.  P;l  and  i p.  The  main  result 
of  trhe  section  will  now  be  given. 


Theorem  2.1.  Suppose  (^(jjC),  ^(jg))  c (0,«)  satisfies 

^ < - 1_ot  V"1  < A < 1 and  Pi  e 1(a)  for  some  integer 

m between  zero  and  ^ The  interval  (A^g),  Ay(x))  given  by 


r 


(2.7) 


/V^)  - i 


v*>  = < 


(2.8)  •Au(^)  = ( 


/V*P  + 1 

0, 

0, 

- 1 
+ 1 


L + 1, 


0 < (X)  < 1 

— L 'v 


1 W 


0 < yx)  < 1 


I h — oo 


satisfies  PpijACAL(£)  < A < A^Xm]  > 1-c  V -1  < A < 1 and 
Px  e 1(A). 


P££of*  First  note  that  f(x)  = (/x  -l)/(i/x+l)  can  be  easily  shown  to 

be  strictly  increasing  on  the  domain  (0,«).  Now  fix  A e (-1,1)  and 

P2  t 1(a)  and  let  \p  be  the  corresponding  odds  ratio.  Suppose  the  sample 

point  u t LAL(X)  < ip  < ip  (X)]}  it  suffices  to  prove  me  [A.  (X)  < A < A (X)]. 

L U ^ 


The  details  of  the  proof  differ  slightly  according  as  (a)  ii^(J|C)  < 

(b)  1 £ and  (c)  ^L()<)  41  1 < '/'y(^)  and  only  (a)  will  be  discussed  as 

the  other  cases  are  similar.  Since  <|<  < ^(JjC)  ± 1 we  have  Mp^)  < 0 = Ay(^) 

by  Lemma  2.2  and  the  definition  of  A . Choose  ip ' « ( if/  ,i|/)  then 

U Li  Li 

/t  _ i - 1 

Afp.sil')  ^ > > A (X)  where  the  first  inequality  follows 

r*''  /Z*  i 

L 

from  Lemma  2.2,  the  second  from  ^ > <Ji'  and  the  strictly  increasing  nature 
of  f(x)  and  the  last  by  the  definition  of  A^(X).  The  proof  is  completed. 

This  method  can  also  be  applied  to  determine  exact  conditional  confidence 
intervals  for  the  relative  risk  p = p^/p^.  F°r  P (0,°°)  and 
(a,b)  c (0,°°)  define  the  analogs  of  (2.1)  and  (2.2)  by 


(2.9)  I'(p)  = 


(0,1),  p > 1 


( 0,p ) , 0 < p < 1 


(2.10)  d'(4>)  = {p  € (O.^jap^  e l'(p)  5 (p-p1)/(l-p1)  = I p}  and 


(2.11)  E’(a,b)  = u D'  (<p) , 

\l>e(a,b) 


D (t|/)  is  the  set  of  p = p^/p^  values  associated  with  the  odds  ratio  \|j.  The 
function  p:  (0,1)  x(0,“>)  (0,<»)  relating  p,p^  and  is  given  by 

P(P1»'/')  = 4'(1-P1)  + p^.  An  analysis  of  this  surface  similar  to  that  given 
by  Theorem  2.1  yields  the  following  confidence  limits  for  p. 

Theorem  2.2.  Suppose  (^(^J,  'Py(^))  satisfies  the  hypothesis  of  Theorem  2.1. 
The  interval  (pL(Jp,  p^X))  given  by 


I 


u 


13 


(2.12)  PL(«)  = 


W*  0 1 W - 1 


i.  i < M*) 

Li  /\j 


(2.13)  P0(X)  = l 


1,  0 < ill  (X)  < 1 

u 


1> „(X),  1 < *T(X)  < » 

U 'X/  Li  r\,  — 


satisfies  Pp  >AtPL(X)  < o < PlJ(X)|m]  > l-o  for  all  p > 0 and 
P1  e I ’ (p ) . 

The  conditional  method  of  this  section  is  computationally  simple  to 
implement.  The  intervals  are  generally  wider  than  the  corresponding  Gart 
intervals.  Table  1 contains  100(l-a)%  two-sided  confidence  intervals 
for  1 N^,  <^10  and  a = .01,  .05  and  .10.  The  computations  were 

made  on  Cornell  University's  IBM  370/168  computer  by  applying  (2.7)  and 
(2.8)  to  the  ip  intervals  of  Baptista  and  Pike  (1977).  Intervals  are  not 
listed  for  all  possible  (x^.Xj)  pairs.  When  (xi>x2)  = (0,0)  or  (N^,N2) 
the  A interval  is  (-1,+1)  while  for  any  other  (x  ,x2>  not  listed  in 
Table  I the  A interval  can  be  obtained  from  the  relationships! 

Al/Xl’X2)  = “VNl"Xl’  N2-x2)  and  AU<X1’X2)  = ~AL(Nl'Xl’  N2*x2)* 

In  contrast  to  the  conditional  method  of  the  present  section  the  direct 
method  of  Section  3 is  computationally  difficult  but  generally  produces 
tighter  bounds  than  those  in  Table  I. 


14 


3.  Unconditional  Confidence  Intervals 


3.1  Introduction 


This  section  describes  a method  for  constructing  confidence  intervals 
which  directly  satisfy  (1.7).  The  following  notation  and  definitions 
will  be  required  throughout.  For  any  R c and  A e (-1,1) 

let  $(R,A ) denote  inftP  .[(X  ,X  ) e Rl] | p € 1(A)}  and  4>(R,A) 

J-  * 1 

denote  sup{P  .[(X  ,X  ) e R ] | p e 1(A)}. 

a 12  1 

Definition  3.1.  A set  U c x is  in  the  northwest  corner  (NWC) 
of  N.^  x N2  provided  (a)  (x1>x2)  c U =>  (x^,*.)  e U for  «.  = x2+l,...,N2 
and  (b)  (x^x.^)  e (J=>U,x2)  e U for  l = 0,...,x1~l. 

Conditions  (a)  and  (b)  are  equivalent  to  requiring  that  the  quadrant  of  points 

in  x N2  to  the  "northwest"  of  any  (x^,x2)  in  U also  be  in  U. 

Suppose  P = { (d^.d^] ,(d^,d2], . . . ,[dn_1,d  ) } is  a partition  of  (-1,1) 

satisfying  -1  = d.  < d < . . . < d =1  and  S = {R, ,...,R  } is  a 
u i n In 

collection  of  subsets  of  x N2.  Here  the  events  need  not  be 

disjoint.  For  a fixed  pair  (P,S)  define  for  each  (x^,x2)  e x ^ 


(3.1)  T(x.,x_)  = u (d.  ,d . ] u 

j:d.<0  3-J-  3 j :dj_1^0 , 


(d.  . ,d. ) u [d.  ,d. ). 

3"1  ^ j:d.  >0  M 3 

3-1 


The  first  result  is  that  TfX^jX^  is  a 100(l-a)%  confidence  region 
for  A provided  (P,S)  satisfies 

Condition  3.1.  <KRj,A)  _>  1-a  v A e [d..  ^dj  and  V j e {l,...,n}. 


Lemma  3.1.  Suppose  (P,S)  satisfies  Condition  3.1  then 
P »[A  « T(X  ,X  )]  > 1-a  for  all  A e (-1,1)  and  all  p,  e 

P^»a  1 Z — 1 


1(A). 


J 
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r 


A € 


i -1*  i J 
0 0 


t and  p^  £ 

1(A); 

let  iQ  : 

(di  -l’di  ■*’ 
o *0 

if 

d.  < 0 

(di  -1  di  >’ 

o xo 

if 

d.  . < 0 

v1- 

Cd.  ,,d.  ), 
0 x0 

if 

d.  , > 0 
10-1 

and 

VU  ,T<W3*  'Pl>1c<W 


$(R,  ,A)  since  p e 1(A) 
10  1 


> 1-a  and  the  proof  is  completed. 


Clearly  there  are  many  (P,S)  pairs  satisfying  Condition  3.1,  for 
example,  the  trivial  pair  P = {(-1,1)}  and  S = x > ; by  Lemma  3.1 
any  of  these  can  be  used  to  generate  100(l-a)%  confidence  regions  for  A. 

Two  additional  intuitive  requirements  will  be  imposed  on  (P,S). 

Condition  3.2.  T(x^,x2)  must  be  an  interval  for  all  (x^x^)  e x N^. 

This  is  equivalent  to  requiring  that  for  each  point  (x^.Xj)  e x N2 
there  are  indices  f = fU^x^  and  l = Ux^x^  satisfying  (x^x  ) c R 
<-»  f < i < l. 

The  last  requirement  deals  with  the  shape  of  R^.  First  some  addi- 
tional notation  must  be  introduced.  Let  = IT^R)  = {i 1 3 j with  (i,j)  e R} 
and  n2  = n2(R)  = {j|a  i with  (i,j)  « R}  be  the  projections  of  R c ^ x 
onto  the  x^  and  x2  axes  respectively. 
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Condition  3.3.  For  Rj,  (1)  must  be  of  the  form  Uj  - U ^ where 

c U'  and  both  are  NWC  sets,  (2)  n ( R ) and  n (R  ) must  be 

intervals  of  integers  and  (3)  there  must  exist  real  numbers  a and  6 > 0 

so  that  a + Bi  >_  s^  s mini  j | (i,  j ) t R^}  whenever  s^  > 0,  a + Bi  ^ 8. ^ 

max{j|(i,j)  e R^}  whenever  (j-a)/B  = min{i|(i,j)  e R^> 

whenever  S.  > 0 and  (j-a)/B  < L.  = max{i|(i,j)  e R } whenever 
3 3 £ 

L.  < N, . Note  that  s.,  I . , S.  and  L.  are  only  defined  for 
3 1 113  1 * 

i e and  3 ^ H^R^). 

Intuitively  (1)  and  (2)  are  connectedness  conditions  which  insure 
that  has  no  holes  while  (3)  is  a technical  condition  under  which 

$(Rj,A)  is  quasi-concave  on  (-1,0]  i.e.  <?>( R^ ) >_  minUfR^.A^.^R^Ag)} 
for  -1  < A < A < A < 0. 

J.  Z.  O 

There  are  numerous  criteria  that  can  be  employed  to  select  from  among 
those  (P,S)  pairs  satisfying  Conditions  (3.1)-(3.3).  Two  examples  are 
(a)  (P,S)  must  minimize  the  average  length  of  the  intervals  T(x^,x2> 

and  (b),  a generalization  of  (a),  (P,S)  must  minimize  a weighted  sum 

of  the  lengths  of  T(x^,x^).  Both  (a)  and  (b)  are  difficult  to  implement 
since  they  only  indirectly  stipulate  conditions  on  (P,S).  This  paper 
contains  an  algorithm  for  generating  (P,S)  based  on  the  so  called 
"greedy"  heuristic.  It  attempts  to  construct  short  intervals  by  forcing 
the  R^  to  be  "small";  as  few  points  as  possible  are  added  to  R^  and 
as  many  points  as  possible  are  removed  for  R.  in  order  to  construct  R.  + ^. 

3.2  The  Algorithm 

Given  a c (0,1)  and  positive  integers  N^  and  the  algorithm 

below  generates  a (P,S)  pair  satisfying  Conditions  (3.1)-(3.3).  Briefly, 


f 


Step  0 is  an  initialization  procedure.  Steps  1 through  5 form  one  inductive 
Etep  and  Step  6 generates  the  final  (P,S)  pair  based  on  symmetry  con- 
siderations. At  each  iteration  Step  1 is  entered  with  constants 
~1  = d0  < di  < •••  < < 0 and  regions  satisfying 

(3.2)  d>(R_. , A)  _>  1-a  V A e (d^  ^,d  ] for  1 < j < i-1 

(3.3)  4> ( R j > d_.  ) = 1-a  for  1 <_  j c i-i 

O-1*)  satisfy  Condition  3.3 

(3.5)  if  I = I(i,x  ,xj  = (j  e {l, . . . ,i)  | (x  ,x  ) e R.)  then  u (d.  ,d  ] 

1 2 1 jcI  i-1*  1 

is  either  empty  or  an  interval  for  every  (x^jX^)  e x N^. 


Two  final  pieces  of  notation  are  required.  Given  R . = (j|  - U.  as  in  (3.4) 
let  x N2  - U . be  the  set  of  points  to  the  "southeast"  of  R.; 

for  arbitrary  S c ^ * N2  let  Sp  = {(x^XjJKn^.H^)  e S)  be  the 

"rotation"  of  S and  |s|  the  cardinality  of  S. 

Ste£_0.  Set  dp  = -1  and  R^  = {(OjN^)}.  Construct  c x ^2  S° 

0,1  Uo  is  in  the  northwest  corner  of  ^ x N2  and  UQ  n Up  = 0 

0.2  ^ x - (UQ  u U^)  satisfies  condition  3.3 

0.3  $(UQ  u Oj, 0)  < a 

0.4  if  B c x N2  satisfies  (0.1),  (0.2)  and  (0.3)  then  either 
lBl  < lu0l  or  |B|  = J UQ | and  »(B  u ^,0)  > *(U0  u (]£,0). 
Set  Lo  = °o»  i = 1 and  go  to  Step  1. 
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Step  1. 


Step  2. 


Set  * (A  e (d^_1,l)|(j)(R^,A)  < 1-a}  and  define  A^  = 1 or 
inf  D1  as  = 0 or  D1  i 0 . 

If  (U Q ~ U / 0 go  to  Step  2. 

If  (U Q - U ^)  = 0 set  = and  if  di  < 0 go  to  Step  4 
while  if  d^  >_  0 go  to  Step  6. 

Set  U = {(x1,x2>  e UQ  n R^|R^  - ((x^x^}  satisfies  Condition  3.3 
and  $(R^  - ((x^.x^J.A)  > 1-a  for  some  A e (d^_^ ,min{ A ,0} ]} . 

If  U i 0 go  to  Step  3. 


If  O’  = 0 set  d^  = A^  and  if  d^  < 0 go  to  Step 


4 while  if 


d.  > 0 go  to  Step  6. 


Step  3. 


Step  4. 


For  each  (x^x^  e O’  set  d^x^Xj)  = inf{A  e [d^  j^.miniA  ,0}]| 

d>(R.  - {(x1,x2)},A)  > 1-a}.  Let  d^^  = mintd^x^  ,x? ) | (x^  ,x2 ) e O’}. 

Construct  U*  c U n R.  so  that 
0 i 

3.1  R.  - U*  satisfies  Condition  3.3. 

3.2  the  infimum  of  {A  e (di_1,min(A1,0})U(R  ~ U*,A)  > 1-a} 

exists  and  equals  d. 

3.3  if  B c Uq  n Ri  satisfies  (3.1)  and  (3.2)  then  | B | <_  |U*|. 

Set  R.+^  = - U*  and  go  to  Step  1. 

Construct  S c L.  - so  that 

l 0 

4.1  S u R^  satisfies  Condition  3.3 

4.2  inf(A  e (d^,l)|$(S  u R^,A)  < 1-a}  > d^  whenever  the  set  is 
nonempty 

4.3  if  B c L,  - Ljj  satisfies  4.1  and  4.2  then  either  |b|  > |s| 
or  | B | = J S | and  <J>(S  u R^dj)  _>  *(B  u Ri,di). 
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Set  R.+1  = S u R.  and  U = ((x^^)  e UQ  n R.+1|R.+1 


{(x1,x2)} 


satisfies  Condition  3.3  and  *(Ri+1  - { (x: ,x2 )} ,d  ) > 1-a}. 

If  U ? 0 go  to  Step  5 while  if  Q = 0 set  R.  = R and 

l+l  l+l 

go  to  Step  1. 

Ste£_5.  Construct  il*  c UQ  n R£+1  so  that  (U*  possibly  empty) 

5-1  ~ satisfies  Condition  3.3 

5.2  inf{A  £ (d^,l)|<J>(R.^^-lffc,A)  < 1-a}  > d^  whenever  this  set 
is  nonempty 

5.3  if  B c UQ  n Ri+1  satisfies  (5.1)  and  (5.2)  then  either 

I B I < |U*|  or  | B | = |U*|  and  <KR.+1-B,d.)  < <KR.+i-(i*,d. ). 


Set  R.  . 
l+l 


R^+1  - U*  and  go  to  Step  1, 


Steg_6.  If  Ri  = R^  then  complete  (P,S)  as  follows:  P = {(-l,d  ],. . . , 

(d£_1»~di_-L)»C-di_1»-di_2^»*  • • »C~d1,l)}  and 
^ = (R^  * * * * jR^  *^i  ^ • »R^}  * 

P 

If  Ri  t Rj  then  complete  (P,S)  as  follows:  P = {(-l,d 

(di_i»03»(°»-di_i]> • • • »C-di»1) ) and  S = {R1,...,Ri,R^,...,R^}. 


3.3  Intuitive  Description 

The  flow  of  the  algorithm  is  diagrammed  in  Figure  B.  It  begins  in  Step  0 

B.  FLOW  CHART  FOR  ALGORITHM 
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by  constructing  two  disjoint  sets  U ^ and  L^  in  the  corners  of  the  region 
N1  * ^2  havin®  to'tal  probability  less  than  a when  A = 0 and  for  any  p^. 

and  Lq  are  used  in  the  completion  process  of  Step  6.  Step  0 also 
initializes  R^  = {(0,N2>}  and  dQ  = -1.  Next  a trial  d^,  A , is 
constructed  in  Step  1 so  that  ij>(R^,A)  >_  1-a  for  all  A e (d^  ^,A^].  Then 
in  Steps  2 and  3 the  algorithm  checks  whether  any  points,  U* , can  be 
deleted  from  R^  if  d^  is  permitted  to  be  smaller  than  A . If  so, 
a revised  d^  is  constructed  and  then  R.+^  is  generated  by  deleting  U* 
from  R^;  Step  1 is  reentered.  If  no  points  can  be  removed  from  R^  and 
d^  < 0 then  the  algorithm  goes  to  Step  4 and  constructs  a trial  by 

adding  points,  S,  to  R^  so  that  4>(R^+1  = R^  u S,d^)  _>  1-a.  Then  in 
the  remainder  of  Step  4 and  in  Step  5 it  checks  whether  any  other  points, 

U*,  can  be  deleted  from  S u R^  while  keeping  <f>(S  u R^  - U*,d^)  1-a. 

At  the  stage  when  d^  >_  0 the  remaining  sets  and  points  d^  are  con- 

structed via  symmetry  considerations  in  Step  6. 


3.4  Properties  of  the  Algorithm 

The  proofs  that  (P,S)  satisfies  Conditions  (3.1)  to  (3.3)  will  be 
given  in  this  subsection  and  the  appendix.  Some  preliminary  results  regarding 
the  behavior  of  4>(R,A)  and  <f>(R,A)  will  be  stated  first.  Let  c£(I(A*)) 
denote  the  closure  of  I (A*). 


Lemma  3.2.  Suppose  R is  a nonempty  proper  subset 
Condition  3.3.  A necessary  condition  for  (p*,A*) 
to  be  a local  minimum  of  q(p  ,A)  = P [(X  ,X  ) c 

1 9 A 12 

(and  hence  <|>(R,A*)  = 0).  Furthermore  in  this  case 
A c (-1,0]  when  A*  < 0,  ii)  <|>(R,A)  = 0 for  A e 


of  N,  x N which  satisfies 
1 2 

(A*  e (-1,1)  and  p*  e c!.(I(A*) 
R]  is  that  q(p^,A*)  = 0 
i)  <J>  ( R , A ) = 0 for 
[0,1)  when  A*  > 0,  and 


iii)  either  <j>(R,A)  = Q for  A e (-1,0]  or  A € [0,1)  when  A*  = 0. 
See  the  appendix  for  the  proof. 
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Lemma  3.3.  Suppose  R satisfies  the  conditions  of  Lemma  3.2.  Then  i)>(R,A), 
regarded  as  a function  of  A,  is  quasiconcave  on  (-1,0],  i.e.  if 
-1  < A1  < A2  < A^  <_  0 then  <J>(R,A2)  _>  min{<j>(R,A  ) ,<f>(R,A3) } . 

Proof.  Suppose  the  contrary,  then  there  exist  -1  < A,  < A„  < A„  < 0 for 

1 4 O "r 

which  $(R,A^)  > 4> ( R , A^ ) and  ^(RjA^)  > <j>(R,A2).  It  can  be  shown  that 

<f>(R,A)  is  continuous  in  A and  hence  there  exists  A*  € (A  ,A  ) satisfying 

X o 

<(>(R,A* ) = min  <j>(R,A).  Choose  p*  e c*  ( 1 ( A*) ) so  that  P ...  .[(X,  ,X0)  e R] 

AeCA^]  P 1 2 

<Ji(R,A*).  Two  cases  arise:  4>(R,A*)  = 0 and  4>  ( R , A * ) > 0.  If  <j>(R,A*)  = 

P jl  A*[(X  ,X  ) e R]  = 0 then  p*  = 0 or  1 + A*  since  R ^ 0.  It  follows 
p»,A5'  1 2 

from  Lemma  3.2  that  $(R,A)  = 0 for  all  A e (-1,0]  which  is  a contradiction. 
If  P ...  ^[(X^Xj)  e R]  > 0 then  choose  any  e-ball,  B,  about  (p*,A*>  so 
that  B n {(p1,A)|A  e (-1,1)  and  p^  e 1(A)}  c {(p^,A)  |A  e and 

p^  c 1(A)}.  By  Lemma  3.2  (p*,A*)  is  not  a local  minimum  and  hence  there 
exist  (p,A)  in  Bn  {(p^,A)|A  e (-1,1)  and  p e 7(A)}  so  that 
P— •^C(X1,X2)  e R]  < Ppft  ^[(X^X^)  £ i-mPii-es  that  <j>(R,A)  < <)>(R,A*) 

which  is  a contradiction  and  the  proof  is  complete. 

The  following  lemma  will  be  used  to  show  that  the  extension  of  Step  6 
gives  regions  which  satisfy  Condition  3.1. 

Lemma  3.4.  Fix  R c and  let  Rp  = { (x^jXj ) | (N^-x^ jN^- 3^ ) e R}  be 

the  rotation  of  R.  Then  4>(R,A)  = 4>(RP,-A)  for  all  A e (-1,1)  and 
4>(R,A)  = $(RP,-A)  for  all  A c (-1,1). 

c c 

Proof.  It  suffices  to  consider  <J> ( R , A ) since  $(R,A)  = 1 - <J>(R  ,A)  where  R 
is  the  complement  of  R in  x N2’  A e (-1,1);  it  is  easy  to  check 
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that  px  e cil(I(A))  <=>  l-pi  e cMI(-A)).  Then  it  follows  that 


N,  N„  , N -i 


Pd  AC(Xl’X9)  6 R]  = l (.1)( .2)p^(l-p  ) 1 (p  -A)j(l-p1+A) 
Pl’  1 1 (i , j )eR  1311  1 1 


N N 

r 1 \ / x 


N -l 
l 1 , 


l 0 (.  )(k‘  Kl-P ,)■ P,  (l-p.+A)K(p  -A) 
U,k)«RP  *’  K 11  1 1 


from  the  change  of  variables  l = N^-i  and  k = N^-j 


■ pi-Pl,-a[<W  e R L 


So  if  p"  e cZ(I(A))  satisfies  <J>  ( R , A ) = P A .[(X  ,X  ) € R]  then 
-L  X z 

<J>  ( R,  A* ) - P a .[(X  ,X  ) eRP]  > 4>  ( RP  , —A  ) . A similar  argument  gives  the 

X-p^j-ii  lx  — 

reverse  inequality  and  completes  the  proof. 

It  is  easy  to  check  that  ^(R^  = {(0,N2)},A)  = ) A | m on  (-1,0] 

where  m = max{N  ,N  };  hence  the  first  time  Step  1 is  entered  A will 
1 1 

be  -(l-a)m  and  the  algorithm  will  go  to  Step  4.  It  will  now  be  shown 
that  the  algorithm  is  executable  at  all  later  iterations  since  Step  4 can 
always  be  implemented. 

Lemma  3.5.  Suppose  {R  , ...,Rd  and  {d  , ...,d^_  } satisfy  Conditions  3.1 
and  3.3.  If  Step  4 is  entered  then  there  exists  a nonempty  set,  S, 
satisfying  (4.1)  and  (4.2)  of  Step  4. 

Proof.  Fix  S = - Lq.  We  claim  S i 0 under  the  hypotheses  of  the  lemma. 

Step  4 is  entered  only  if  = (A  e (d^  ^,0)  |<|>(R^,A)  < l-a)  is  nonempty 
and  d.  = inf  D..  < 0.  Hence  there  exists  A e (d.,0)  such  that  <f>(R.,A)  < l-a 
while  ^(R^jd^^)  l-a  by  construction.  The  level  set 
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{A  c (-1,0)|  (}>  (R^  ,A ) >_  1-a}  is  convex  since  <f>  is  quasiconcave  and  so 

<t>  ( R . , 0 ) < 1-a . On  the  other  hand  if  S = L . - L„  = 0 then  L . = L„  and 

i i o l 0 

<J)  ( » 0)  = 4>  ( x - (Lq  u IL),0)  >_  cf>  (N^  x N2  - (LQ  u UQ),0)  >_  1-a  where 
the  first  inequality  follows  from  IT  c U and  the  second  from  Step  0. 

This  contradiction  shows  S i-  0.  It  is  proved  in  the  appendix  that 

S u R.  : K1  x - (Lq  u (K)  satisfies  (4.1).  The  above  inequalities  show 

that  <p(S  u R.,,0)  >_  1-a  while  <f>(S  u R^,d^)  >_<|>(R^,d^)  >_  1-a  . Hence 

(A  e (d^,l)|4>(S  u R..,A)  < 1-a}  e (0,1)  by  the  convexity  of 

{A  e (-1,0]|4>(S  u R^,A)  >_  1-a}.  This  implies  that  (S  u R^)  satisfies 

(4.2)  and  completes  the  proof . 

Theorem  3.1.  The  (P,S)  pair  constructed  by  the  algorithm  satisfies 
Condition  3.1. 

Proof.  The  inequality  <p(R^,A)  >_  1-a  for  A e [d^  ^,d^]  holds  by  con- 
struction for  any  i with  d^  <_  0.  Lemma  3.5  shows  that  the  inductive 
steps  1 through  5 can  stop  only  with  d^  >_  0 and  the  execution  of  Step  6. 
Lemma  3.4  shows  that  the  above  inequality  holds  for  i with  d^  >_  0 and 
the  result  is  proved. 

The  next  two  results  demonstrate  Condition  3.2  that  T^^x^)  is  always 
an  interval.  They  are  based  on  the  following  easily  derivable  representation 
of  T(x1,x2). 


T(x1,x2)  = T1(x1,x2)  u (-T1(N1-x1,N2-x2)) 


where  T (x  ,x  ) = (-1,0]  n u (d  ,d.  ],  I(x  ,x  ) = (j|(x  ,x  ) e R , 

1 1 * jel(x1,x2)  ] :+1  12  12  ]+1 


jeKx^,x2) 


d..  < 0}  and  -T^x^x^  = [0,1)  n 


C-d^+1,-d_.).  The  set  T^x^x^ 


is  that  part  of  Kx^x^)  on  and  to  the  left  of  the  origin;  the  set 
(-T,  (N^-x  ,N2-X2  > ) '*'s  that  Part  of  T(x  ,x  ) on  and  to  the  right  of  the 
origin. 


Lemma  3.7.  For  every  (x^,x2>  e , T]/xq,X2^  as  an  interval. 

Proof.  If  T^(x^,x2)  = 0 the  result  is  trivial;  assume  T.(x^,x  ) / 


and 
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Suppose  T^Cx^x^)  is  not  an  interval  so  that  there  exists  A^,  A^ 

A_  satisfying  -1  < A < A < A <_  0 for  which  A ,A  e T (x  ,x  ) and 

O X Z O X o X X Z 

A i T (x  ,x  ).  Let  i < i < i be  the  indices  for  which  A e (d.  ,d.  ]> 

Z X X Z 1 4 w X 1^*"X 

A.  € (d.  ,d.  ] and  A € (d.  ,d.  ].  The  point  (x,  ,x  ) e R.  = U'.  - (i.  = 

2 i2-l  i2  3 i3-l  i3  F 12^^ 

(x  ,x_)  e U.  c ti!  since  the  sequence  {U1.}  is  nondecreasing  by  construction. 

1 2 l i 3 


But  (x  ,x  ) i R.  and  (x  ,x  ) e R. 

12  12  12  13 


(x„,x„)  £ U.  - U.  which  is 
12  x2  x3 


impossible  since  the  {LL}  sequence  is  also  nondecreasing  and  completes  the 
proof. 


Theorem  3.2.  For  every  (x^,x2)  e , T(x^,x2)  satisfies  Condition  3.2. 

Proof.  If  T^(x^ ,x2 ) = 0 or  T1(N1-x1,N2~x0)  = 0 then  the  result  is 
immediate  from  Lemma  3.7.  Now  suppose  T^(x3,x2)  / 0,  T^CN^-x^ ,N2~x2 ) / 0 
and  T(x^,x2>  is  not  connected.  Let  t be  the  index  for  which 

0 e (d^,dT+1].  It  follows  that  either  0 i T^(x^,x2)  and/or  0 i ’N2_X2 ^ 

<=>  either  (x^,x2)  e and/or  (N^-x^ >N2-X2 ) e ^x+l'  last  equivalence 

follows  from  the  fact  that  if  (x, ,x.)  e L , =>  (x, ,x„)  c L.  for  j < t 

1 2 t+1  12j  — 

since  {L^}  is  a nonincreasing  sequence  =>  T^Cx^jX^  = 0 which  is  impossible. 
Assume  wlog  that  (x^x^  e Ut+1  c Uq  = Lq  c L^+1  =*  ,N2~x2  ) e Lt+1  => 

Ti(^1_xi’N2_X2)  = ^ which  is  a contradiction.  A similar  contradiction  arises 
if  (N1-x^,N2-x2 ) e and  completes  the  proof. 


f 

25 

Theorem  3.3.  All  regions  in  the  family  S constructed  by  the  algorithm 

satisfy  Condition  3.3. 

Proof.  The  regions  , . . . ,R^+^(  0 e (d^d^^])  satisfy  Condition  3.3  by 
construction.  It  can  easily  be  checked  that  if  R satisfies  Condition  3.3 
then  R also  does.  The  result  follows  since  for  i > t,  r.  = gP  for 


4.  An  Example  and  Conclusions 


This  section  will  present  a detailed  example  to  illustrate  the 
unconditional  method  of  Section  3 and  make  some  comparisons  among  the 
Thomas-Gart  (TG)  method,  the  conditional  method  of  Section  2 and  the 
unconditional  method  of  Section  3. 

The  example  below  constructs  80%  confidence  intervals  when 
= n2  = ^ ‘ T'le  samPle  size  = 2 was  chosen  to  keep  the 

computations  feasible  by  hand  while  the  relatively  large  value  of 
a = .2  was  selected  to  illustrate  Step  5.  Step  5 is  not  used  for 
a < .125. 

Initialization : 

Step  0:  Set  dQ  = 1,  R^  = {(0,2)}.  Let  U = {(0,2)};  U satisfies 
0.1,  0.2  and  0.3  (4>((Ju  (/*, 0)  = .125  < .2).  U also  satisfies  0.4 

since  for  any  other  candidate  set  B either  = {(0,2), (0,1)}  c B 
or  U2  = {(0,2), (1,2)}  c B and  hence  $ (B  u BP)  _>  #(U1  u UP,0)  = 4>(U2  u liP,0) 
= .375  > .2.  So  UQ  = U = {(0,2)}  and  LQ  = {(2,0)}. 

Iteration  1: 

2 

Step  1:  <KR1,A)  = A or  0 according  as  A < 0 or  A > 0 so 
Dx  = {A  e (-1,1)|^(R1,A)  < .8}  = (-/T8,l)  and  AL  = inf  D1  = -/T? 

= -.8944.  Go  to  Step  2. 

Step  2:  R - {(0,2)}  = $ =>  sup  $(R  - {(0,2)}, A)  = 0 < . 8 ->  U = <p . 

A e(-l»- . 8944) 

Set  d^  = -.8944  and  go  to  Step  4. 

Step  3.  Candidate  sets  satisfying  (4.1)  and  |S|  =1  are  S^  = {(0,1)} 
and  S2  = {(1,2)}.  In  both  cases  <)>(S^  u R^,A)  = $(R^,A)  implies  that 
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inf{A  e (-.  8944,1)  | <J>( u R^,A)  < .8}  = -.8944  violating  4.2. 

Candidate  sets  satisfying  (4.1)  and  |s|  = 2 are  S.^  = {(0,0), (0,1)}, 

52  = {(1,2), (2, 2)}  and  S = {(0,1), (1,2)}.  For  i=l  and  2 

♦ (Si  U RX,A)  = <fr(R^,A)  =>  inf  {A  e (-.  8944,1)  | <>(  S±  U R^A)'^  .8}  = T.8944 
again  violating  4.2.  However 

/* 

(1-A)3(5+3A)/16,  0 < A < -.4415 

<J>(S3  U R1,A)  = < -2A  - A2,  -.4415  _<  A < 0 

0,  0 _<  A < 1 

v 

=>  inf{A  e (—.8944,1)  |<J>(S^  u R^,A)  < .8}  = -.5754  > d^.  By  construction 

53  satisfies  4.3  and  hence  S = S^.  Set  R2  = {(0,2), (0,1), (1,2)}; 

U = <p  since  4>(R2  - {(0,2)}, -.8944)  = .1795.  Finally  set  R2  = R? 
and  go  to  Step  1. 

Iterations  2,  3,  4 and  part  of  5 are  summarized  in  Table  2.  The  values 
of  A^  listed  in  column  3 are  calculated  in  Step  1.  In  all  4 cases  the 
maximum  of  4^  - Uq,A)  over  A in  Cdi  ,AX]  is  less  than  .8  implying 
that  U = 4 and  the  algorithm  goes  from  Step  2 to  Step  4.  Column  7 shows 
that  U = <p  in  iterations  2,  3,  and  4 while  U = {(0,2)}  in  iteration  5. 

We  now  complete  iteration  5. 

Iteration  5 (cont.): 

5:  Set  U*  = {(0,2)};  Rg  - U*  = {(0,0), (0,1), (1,0), (1,1), (1,2), 
(2,1), (2, 2)}  satisfies  (5.1);  (5.2)  holds  since 
inf  {A  e (-.0757,1)  |4>(R6  - U*,A)  < .8}  = .3137  > -.0757  while 
(5.3)  is  trivially  satisfied.  Set  Rg  = R - {(0,2)}  and  go  to  Step  1. 


ITERATIONS  2-5  OF  ALGORITHM 


Step  5 must  be  executed  during  Iteration 


Iteration  6: 


Step  1:  Dx  = (.3137,1)  so  that  ^ = .3137;  U0  ' U6  = so  set 
dg  = .3137  and  go  to  Step  6. 

Step  6: 


(-1.-.8944] 

(-.8944, -.5754] 

R2  = {(0,2), (0,1), (1,2)} 

(-.5754, -.5528] 

R3  = {(0,2), (0,1), (1,2), (1,1)} 

(-.5528, -.1649] 

R4  = {(0,2), (0,1), (1,2), (1,1), (0,0), (2, 2)} 

(-.1649,-. 0757] 

R5  = {(0,2), (0,1), (1,2), (1,1), (0,0), (2, 2), (1,0)} 

(-.0757,  .0757] 

Rg  = Rg  = {(0,1), (1,2), (1,1), (0,0), (2, 2), (1,0), (2,1)} 

[.0757, .1649) 

R?  = Rg  = {(2,0), (2,1), (1,0), (1,1), (2, 2), (0,0), (1,2)} 

[.1649, .5528) 

R8  = Rj  = {(2,0), (2,1), (1,0), (1,1), (2, 2), (0,0)} 

[.5528, .5754) 

Rg  = R3  = {(2,0), (2,1), (1,0), (1,1)} 

[.5754, .8944) 

R10  = Rj  = {(2,0), (2,1), (1,0)} 

[.8944,1) 

R11  = = 

The  80%  confidence 

intervals  for  A = - p2  are: 

x x-  T(x. ,x0) 

(-1.-.0757] 
(-.8944, .0757) 
(-.8944, .1649) 
(-.5754, .5754) 
(-.5528, .5528) 
(-.5528, .5528) 
(-.0757, .8944) 
(-.1649, .8944) 
[. 0757,1) 
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Remark  4.1.  As  iteration  4 illustrates,  there  can  be  several  sets  S 

satisfying  (4.1)-(4.3)  and  several  sets  U*  satisfying  (3.1)-(3.3)  or 

(5.1)- (5. 3).  Randomization  or  an  arbitrary  selection  rule  can  be  used 

to  break  such  ties.  For  example  the  following  rule  is  used  here:  "choose 

1 v 

the  set  S minimizing  -m-  l i;  randomize  among  sets  tied  accord- 

II  (i, j )eS 

ing  to  this  criteria". 

Remark  4.2.  So  far  no  mention  has  been  made  of  the  computational  work 

required  to  implement  the  algorithm.  In  the  example  when  R2  = {(0,1), (0,2), (1,2)} 

and  A < 0: 


<KR 0,M  = min  T(X.,X  ) e R ] 

2 Pie[0,l+A]  pl’  1 2 2 


min  {-3p^  + 6(A+l)p^  - (3A2  + 10A  + 5)p2  + 2(2A2  + 3A  + l)p  - (A2+2A)}. 
Ple[0,l+A]  11  1 1 


Minimizing  P .[(X  ,X  ) e R ] in  p for  fixed  A requires  comparison 

It  2 1 

of  the  function  values  at  the  bounding  points  0 and  1+A  and  at  the  zeroes 


of  the 


equation  P'  [(X  ,X  ) e R ] = 0 where  the  prime  denotes  partial 
12  2 


differentiation  with  respect  to  p^.  In  this  case  the  zeroes  of  a 3—  degree 
polynomial  must  be  computed.  In  the  general  case  the  zeroes  of  an  (N^+N2~l) 
degree  polynomial  must  be  computed.  This  particular  case  can  be  simplified 
by  reparameterizing  the  problem  to  w = (l+A)/2  - p . 

For  A < 0: 


(J>(R2,A)  = min  P 1+A  CX  ,X  ) e R ] 

|»|<±£  4 

= min  { ( 1-A )4/16  + (l-A2)(l-A)2/4  ♦ (1.5A2  - A - ,5)w2  - 3w4}, 

I I 1+A 

M<.— 


m 
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2 2 

The  terms  in  brackets  are  a function  of  t = w , say,  g(w  ).  So 


<KR0,A)  = min  {g(0)  + (1.5A2  - A - .5)t  - 3t2} 

0<t<li±^ 


min  g(t). 

0-^ 


Clearly  g(t)  is  concave;  its  minimum  is  either  achieved  at  0 or 
(l+A)2/4.  For  A <_  - . 4415,  *(R  ,A)  = g(0)  = (1-A)4/16  + (l-A)2(l-A2)/4; 
for  -.4415  < A <_  0,  <(i(R2,A)  = g((l+A)2/4)  = -A(2+A). 

In  general,  the  order  of  the  polynomial  in  p , P A[(X  ,X  ) e R ], 

J.  JL  Z 

can  be  halved  by  the  same  reparameterization  whenever  = N2  and 

= ^N1~X2’  N2_Xl)  Uxi»x2^  e These  conditions  imply  that  ^(X^X^fR^ 

is  symmetric  in  p about  (l+A)/2. 

When  the  algorithm  is  applied  to  the  = N2  = 2 case  for  a = .01, 

.05  and  .1  it  yields  the  A intervals  of  Table  3.  The  corresponding 
conditional  A intervals  of  Section  2 are  listed  in  Table  4 and  the 
Thomas-Gart  A intervals  based  on  the  Baptista-Pike  tp  intervals  are  listed 
in  Table  5.  Note  that  the  99%  Thomas-Gart  A intervals  of  Example  1.2 
are  based  on  the  Thomas  (1971)  <p  intervals.  Hence  the  intervals  of  Table  5 
are  never  wider  than  those  listed  in  Example  1.2.  We  shall  make  several 
comparisons  among  these  intervals. 

We  begin  by  continuing  Example  1.2.  The  actual  coverage  probabilities 
of  A = 1/2  are  listed  below  when  p1  = 3/4  and  p2  = 1/4.  The  intervals 


are  taken  from  Tables  3,4  and  5;  all  have  nominal  99%  confidence  coefficients. 
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3.  UNCONDITIONAL  CONFIDENCE  INTERVALS  FOR  N = N?  = 2 


X1 

X2 

90% 

95% 

99% 

0 

2 

-1 

0 

-1 

.0543 

-1 

.3676 

0 

1 

-.9487 

.2747 

-.9747 

.4294 

-.9950 

.6708 

1 

2 

-.9487 

.3591 

-.9747 

.5028 

-.9950 

.7183 

1 

1 

-.7147 

.7147 

-.8048 

.8048 

-.9160 

.9160 

0 

0 

-.6838 

.6838 

-.7764 

.7764 

-.9000 

.9000 

2 

2 

-.6838 

.6838 

-.7764 

.7764 

-.9000 

.9000 

2 

1 

-.2747 

.9487 

-.4294 

.9749 

-.6708 

.9950 

1 

0 

-.3591 

.9487 

-.5028 

.9747 

-.7183 

.9950 

2 

0 

0 

1 

-.0543 

1 

-.3676 

1 

4.  CONDITIONAL  CONFIDENCE  INTERVALS  (SECTION  2)  FOR  Nx  = N2 


X1 

X2 

90% 

95% 

99% 

0 

2 

-1 

.1178 

-1 

.2515 

-1 

.4811 

0 

1 

-1 

.5 

-1 

.6268 

-1 

.8174 

1 

2 

-1 

.5 

-1 

.6268 

-1 

.8174 

1 

1 

-.7151 

.7151 

-.7945 

.7945 

-.9043 

.9043 

0 

0 

-1 

1 

-1 

1 

-1 

1 

2 

2 

-1 

1 

-1 

1 

-1 

1 

2 

1 

-.5 

1 

-.6268 

1 

-.8174 

1 

1 

0 

-.5 

1 

-.6268 

1 

-.8174 

1 

2 

0 

-.1178 

1 

-.2515 

1 

-.4811 

1 

-1 


1178 


-1 


2515 


-1 


-1 

.3486 

-1 

.4150 

-1 

-1 

.3486 

-1 

.4150 

-1 

7151 

.7151 

-.7945 

.7945 

-.9043 

-1 

1 

-1 

1 

-1 

-1 

1 

-1 

1 

-1 

3486 

1 

-.4150 

1 

-.4808 

3486 

1 

-.4150 

1 

-.4808 

1178 

1 

-.2515 

1 

-.4811 
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.996,  based  on  Table  3 

P3/4  1/2  CAL(^)  < 1/2  < AU(^);l  = { -996’  baSed  °n  Table  4 

.949,  based  on  Table  5. 

V. 

Our  intervals  gain  extra  coverage  probability  as  a decreases 

since  they  become  much  wider  than  the  TG  intervals  for  m = 1 and  3 . 

In  general  our  conditional  intervals  and  the  TG  intervals  are  both  (-1,1) 

when  m = 0 or  N^  + N^;  they  coincide  in  a non- trivial  interval  when 

<l> T < 1 < <|4,  and  m = N = N . For  other  choices  of  m when  N,  > N 

and  > 1 the  unconditional  intervals  become  much  wider  than  the  TG 

intervals  as  a decreases;  this  characteristic  is  the  source  of  the 

counterexample  given  in  Section  1.  For  fixed  (x  ,x  ),  A. (x  ,x  ) should 

12  L 1 2 

intuitively  approach  -1  and  A^Cx^.x^)  should  approach  1 as  a decreases 

to  zero.  It  will  be  shown  below  that  for  a given  m,  A and  A generated 

U L 

by  the  TG  method  are  constrained  to  a proper  subset  of  (-1,1)  unless 
m = and  hence  cannot  attain  +1  and  -1  respectively  as  a 

decreases.  It  follows  that  counterexamples  similar  to  Example  1.2  are 
possible  even  for  large  N^,  and  N2  by  examining  A near  +1  and  -1, 
for  these  values  will  be  excluded  from  certain  A intervals  regardless 
of  the  a chosen. 

Our  conditional  intervals  are  generally  wider  than  our  unconditional 
intervals  although  this  is  not  uniformly  the  case  as  the  outcome  (X^X^)  = (1,1) 
shows  when  a < .1.  This  phenomenon  can  be  explained  by  looking  back  at 
the  example.  In  iteration  2,  S = {(1,1)}  is  chosen.  If  instead 
S = {(0,0), (2, 2)}  had  been  used  (in  violation  of  (4.3))  the  following 
changes  would  result  in  the  unconditional  intervals: 
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xL  x2  95%  99% 


1 

1 

-.7623 

.7623 

-.8946 

.8946 

0 

0 

-.8048 

.8048 

-.9160 

.9160 

2 

2 

-.8048 

.8048 

-.9160 

.9160 

These  unconditional  intervals  are  uniformly  more  narrow  than  the  conditional 
intervals  of  Table  4.  However  the  revised  set  of  unconditional  intervals 
is  not  uniformly  more  narrow  than  the  original  unconditional  intervals 
of  Table  3 (nor  is  the  reverse  true).  Furthermore  the  use  of  the  revised 
unconditional  intervals  over  those  of  Table  3 results  in  an  increased  total 
(average)  length  of  the  intervals  from  14.9254  (1.6584)  to  14.9522  (1.6614) 
when  a = .01  and  from  12.5870  (1.3986)  to  12.6156  (1,4017)  when  a = .05. 
This  computation  illustrates  the  operation  of  the  "greedy"  heuristic  in  the 
form  of  (4.3). 

We  conclude  this  section  by  showing  that  A and  A are  bounded 

U L 

away  from  +1  and  -1  respectively  except  when  m = = N2.  Assume 

wlog  that  N1  _>  max{N2>2}.  Fix  (x^.x^  satisfying  0 < m = x^  + x2  < + N2 

(the  other  two  cases  are  trivial).  Given  0 < i|>  < <|i  < °°  then  the 

L — U 

x ,x  calculated  from  (1.5)  satisfy: 

Li  u 

(4.1)  max{0,m-N„}  < x ,x  < min{m,N}. 

Z Li  U 


Then  A = A^  and  A^  are  calculated  from  x = x^  and  respectively 

by 


A 


x (m-x) 


(4.2) 
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Substituting  the  bounds  (4,1)  into  the  equation  (4.2)  gives  the  following 
bounds  on  Ar  and  A,,: 


m-N  -N  N +N  -m 

max{iy  < VAU  < min{  }* 


When  m = these  bounds  are  -1  and  +1;  when  m i the  lower 

limit  is  greater  than  -1  and  when  m i N the  upper  limit  is  less 
than  +1.  Hence  if  for  some  A < 1 there  is  a nonempty  set  A c x for 

which  £ e A =>  A ^ (A  (jtJ.A^jt))  for  all  a then  for  any  e > 0, 
p^  can  be  chosen  to  satisfy 

P aCAt(X)  < A < A (X)]  < inf  P ,[X  | A]  + £ 

P1’A  U%  -«I(A)  *’A 

= 1 - 4>(A,A)  + e. 


It  follows  that  for  any  a < $(A,A),  (AL(;<),  A (JC))  cannot  satisfy  (1.7). 
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5.  Summary 

This  paper  has  adopted  a frequentist  approach  to  the  problem  of 
determining  exact  confidence  intervals  for  A = p^  - p^  in  2x2 
contingency  tables.  Since  this  is  a nuisance  parameter  problem  the 
intervals  proposed  achieve  converage  probabilities  greater  than  or 
equal  to  their  nominal  (1-a)  levels.  The  conditional  intervals  of 
Section  2 are  easily  computed  from  conditional  ip  intervals.  The 
unconditional  intervals  of  Section  3 are  much  more  difficult  to  compute 
but  generally  yield  narrower  intervals  than  the  conditional  ones.  The 
exact  method  of  Thomas  and  Gart  (1977)  should  be  considered  an  asymptotic 
method  appropriate  for  reasonably  large  a.  Conditional  intervals  for 
p are  also  presented. 
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Appendix 

Two  preliminary  lemmas  are  presented  below  which  are  required  in 
the  proof  of  Lemma  3.2. 


Lemma  A . 1 . Suppose  R^  satisfies  Condition  3.3;  denote 
ni^RJX2^  = {*1|(x1,*2)  e Rj  and  ll^Rjx^  = {x^U^x^  e Rj. 
Whenever  these  sets  are  nonempty  they  are  intervals  of  integers . 
The  proof  follows  from  an  easy  contradiction  argument. 


Lemma  A . 2 . Suppose  R = U’  - U where  U c U’  f N x N2  and  both  are 

NWC  sets.  For  p (A)  = A then  either  G(A)  = P.  ,[(X, ,X„)  e R]  = 0 
1 A, A 1 2 

for  all  A e (0,1)  or  there  exist  integers  s,£  and  m > 0 satisfying 

l 

0 < s < l < m for  which  G ( A ) = £ (1')A',(1-A)rn  for  all  A e (0,1). 

A=s  ^ 

Furthermore  analagous  representations  hold  when  p^(A)  = 1 and  A > 0, 
or  p^  = 0 and  A < 0 or  p (A)  = 1+A  and  A < 0. 

Proof . By  Lemma  A.l  IT^ ( R | ei-ther  empty  or  has  the  form 

(s ,s+l, . . . , £}  for  some  0 <_  s _<  il  < N2>  When  p^  = A then 

p = p -A  = A-A  = 0 and  hence  G(A)=0  V A e (0,1)  when  II  (R|N„)  = <f> 

1 1 i 1 1 

and  G(A)  = \ (t!,)A](l-A)m":]  Vie  (0,1)  when  n (R|N  ) t <f>.  This 

j=s  : 

completes  the  proof. 

Proof  of  Lemma  3.2.  First  we  consider  the  case  when  (p*,A*)  is  on  the 

boundary  of  6 = { (px , A ) | A e (-1,1)  and  p1  e 1(A)}.  Suppose  A*  > 0 

and  p*  = A*;  a slight  modification  of  the  argument  below  works  for  any 

A*  t 0 and  p*  e c U.  I(  A*) ) - I(  A*) . From  Lemma  A. 2,  G(  A)  = P.  „[(X,,X „)  e R] 

1 «...  1 2 

is  either  0 V A e (0,1)  or  has  the  form  \ (™)A^ (1-A)m  ■*  for  some 

j=s  3 

m and  0 _<  s <_  l <_  m.  Hence  if  G(A*)  = 0 then  G(A)  = 0 for  A e (0,1). 

We  claim  G(A*)  >0  is  impossible.  If  G(A*)  > 0 then  the  second 


X-J. 
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representation  holds  with  0 <_  s <_  £ <_  m.  Note  that  G'(A*)  = 0 and 

G’’(A*)  0 since  A*  is  a local  minimum.  If  s = 0 and  i = m then 

G(A*)  = 1 which  is  impossible.  If  s = 0 and  l < m then 
A , , 

G ( A ) = 1 - / b(u,JL+l,m)du  where  b(u,k,n)  = — r-yy  u-i(l-u)n 

q ^ \ K—J.  ) • v n—  K ) , 

while  if  0 < s and  l = m then  G(A)  = / b(u,s,m)du.  In  either 

0 

case  G’(A*)  ^ 0 and  hence  is  impossible.  If  0 < s ^ Jl  < m then 
1 

G(A)  = / (b(u,s,m)  - b(u, £+l,m)}  du,  G'(A)  = b(A,s,m)  - b(A,Jt+l,m)  and 

after  some  algebraic  manipulation,  G''(A)  = G 1 ( {£+l-s} 

A(l-A)  A (1-A) 

< - ^ A ^ * In  particular  G ' ' ( A* ) < 0 since  G'(A*)  = 0 

which  is  again  impossible.  Now  suppose  A*  = 0 and  p*  = 0,  then  p*  = 0. 

Hence  G(0)  = <p  (R,0)  = 0 or  1 according  as  (0,0)  ^ R or  (0,0)  e R. 

The  latter  case  is  impossible  since  (p*,A*)  is  a local  minimum  and 

R / N1  x N2>  To  show  that  either  ( R,A ) = 0 for  all  A € (-1,0)  or  for  all 

A e (0,1),  suppose  not.  Then  there  are  -1  < A^  < 0 < A ^ < 1 so  that 

4>(R,A.)  > 0 for  i = 1 and  2.  This  implies  P t(X  ,X  ) e R]  > 0 and 
•1  0 ^ 12 

P a » C (x  ,X  ) e R]  >0.  Hence  there  exist  positive  integers  x*  and  x* 

2>U2  ^ ^ Jl  ^ 

so  that  (0,x*)  e R = U'  - U and  (x*,0)  e R.  Now  (x*,0)  e R and 

(0,0)  {R  =*(0,0)  e U =>(  0,x2)  e U for  all  x2  e {0, . . . ,N  } =>  (0,x*)  R 

a contradiction.  A slight  modification  of  the  above  argument  yields  the 

result  when  A*  = 0 and  p*  = 1. 

We  now  show  that  a local  minimum  cannot  occur  at  (p*,A*)  e 6. 

Suppose  ( p* , A* ) eg  is  a local  minimum  then  P . ..[(X,,X„)  e R]  > 0 
1 P",o5>  1 2 

since  p*  is  not  on  boundary  of  7 ( A* ) . Choose  e > 0 so  that  the 
open  ball,  B,  of  radius  e satisfies  (1)  B c g and 

(2)  Ppft  A*^X1’X2^  € — Pp  A^Xl’X2^  € whenever  (p^A)  e B. 

Define  p*  = P*  “ and  for  i e JI^(R)  let  s^  = minn^tRji)  and 
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: 


JL  = maxII2(R|i);  also  let  H(p^,p^)  be  defined  on  [0,1]  x [0,1]  by 


"(fl’p2>  ' Pp1,p1-p2C(>il'X2>  e R] 

N . N -i  1 

= l ( )PJ(1-Pl)  J b(u,l.+l,H  )du  + 

i'C  p2 

N . N -i  P2 

I ( .i)pJ(l-p1)  1 I [b(u,s  N ) - b(u,t.+l,N_)]du  + 

: ^\a  ± ± ± - 1Z  1Z 


N M -i  P2 

I ( .•  Jp/l-P,)  1 I b(u,s.,N  )du  + 
ieT  1 1 0 1 i 


N.  . N-l 

l < i1)Pl(1-Pl)  1 

ieA 

where  C = {i  e 5L(R)|o  = s.  < < N>,  M = {i  e H,(R)|o  < s.  < i.  < N.) , 

1 ' l—i  2 1 1 l—  i 2 

T : (i  « Il^fRjIo  < s£  .1  ^ = ^2}  and  A = (i  e IT^(R)|o.=  and  = N2). 

The  case  B = M = T = <f>  is  impossible  since  H(p  ,p  ) must  then  be 

N N -i  l N . N -i 

independent  of  p2  of  the  form  H(p1,p2>  = £ ( ^(l-p^  = \ ( Tjp^l-p^^)  1 

ieA  i=s  1 

for  some  0 s <_  8.  <_  N by  Condition  3.3.  In  particular  p*  must  be  a 

l Ni  i N -i  1 

local  minimum  for  G(p1>  = £ (,  )p^(l-p^)  . Arguments  similar  to  those 

i=s 

above,  show  this  is  impossible. 

Hence  at  least  one  of  the  set  R = {(i,s.-l)|i  e n.(R)  with  s.  > 0} 

w 1 1 1 

and  R^  = {(i,<L)|i  e JI^(R)  with  £.  < N2)  must  be  nonempty.  Now  since 
(p*,A*)  is  a local  minimum  it  follows  that  VH  = VH(p*,p*)  = 0 and 

**■  Z 

V*H(  p*,p*)  is  positive  semi-definite,  i.e.,  z'VHz  >_  0 for  all  y e R2. 

Let  z = (A,-l)  then 


32H  , 82H 

3p2  3P13P2 


2 2 
, 2 d H 1 JH 

3p2  " X 3p23pl 


z'VHz  = 
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Let  = a + 6i  (6  > 0)  be  the  line  specified  by  Condition  3.3  which 
passes  through  R and  let  \ = gp^l-p^/p^l-p^ . Then 


(A.l)  z'VHz  = 


92H  6Pl(1~Pl>  32H  a _3H_ 

3p2  " p2a~P2)  3pl3p2  " Pl^-Pl^  3p2 


bp2(i-p2) 

2 

2 

3 H 

1 P2(1_P2)  32H 

a 

3H 

pi(1_pi^ 

6 Pl(1-Pl)  3p23pi  + 

6pia"Pl)  3p2j 

derivatives 

are  evaluated  at  (p*,p*)  in 

(A.l). 

Both 

bracketed  terms  will  be  shown  below  to  be  negative  thus  leading  to  the 
desired  contradiction. 

After  two  differentiations  and  a rearrangement  of  terms,  the  first 
bracketed  expression  in  (A.l)  can  be  shown  to  be 


(A. 2)  N l 

(i,j)eRc 


si-(a+6i) 

P*U-p*) 


N,-i  N -1  s.-l 


1 • -X  O.-J. 

( .^(p^a-p*)  (g^Kp*)  1 (i-P*) 


N2-Si 


- N„ 


l 

(i,A)cR 


^i+l-(a+6i) 

. p*(1-p2)*  J 


N1  i Nl_i  V1  *i  V 

( /xpj^a-p*)  ( 2.  xp*)  ^i-p*)  2 


i-niP*+(n2-i)p* 

P^(l-P2)* 


3H 

3Po 


By  assumption 


3H 

3Po 


D,  s.  - (a+0i)  <_  0 for  (i,j)  e R and  i.  - (o+6i)  > 0 

■L  b 1 

for  (i,j)  e RL.  Furthermore  PpA  ^[(X^X^l)  c R^]  can  be  shown  to  be 
3H  ^ 

positive  from  = 0 and  the  fact  that  Rg  u R^  must  be  nonempty.  Dropping 

the  first  term,  rewriting  the  second  as  a probability  and  setting  the  last 


equal  to  zero  gives  the  following  upper  bound  on  (A. 2): 
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-N„ 


= P * *C(X,,X  -1)  e R ] < 

Pid-P?)  pip*  1 2 L 


0. 


A similar  argument  shows 

32H  P2(1~P2)  92H  a 3H 

ap2  - 6Pia-Pl)  9p2aPl  + BPld-Pl)  3Pl  " 

and  the  proof  is  completed. 

The  following  two  lemmas  establish  that  u S satisfies  Condition  3.3 
for  S = L.  - L„. 

l 0 

Lemma  A. 3.  Suppose  R.  = U|  - U.  satisfies  Condition  3.3  and  d>(R.,d.  ,)  > 1-a 
*^iii  y i’  i-1  — 

where  a £ (0,1)  and  d^_^  <_  0;  if  £ L^  then  (x  -l,x2)  ^ U ^ 

and  (x1,x2+l)  U ^ whenever  these  latter  two  points  are  in  x N2. 

Proof.  Suppose  R^  satisfies  the  above  conditions  and  there  exists 
(x^,x2)  e which  (x^-l,x2)  e U ^ (the  case  (x^,x0+l)  is  proved 
analogously).  By  assumption  IT2 ( R^ ) = (j  integer |s  <_  j <_  £}  for  some 
0 < s < £ < N2>  For  any  k = we  have  (k,x2)  e IL  "*  (k,x2)  $ R^ 

=>  either  x?  < s or  x2  > l.  If  x2  < s =>  ( 0 , fi. ) { Ri  for  l <_  x2; 

(x1~l,x2)  e (h  =>  (0,x2)  e (K  =>  (0,£)  ^ R^  for  l > x2  and  we  conclude 
that  the  entire  line  {(0,£)|fc  = 0,...,N2>  is  not  in  R^ . But  this  implies 


0 < l-o  < <KR.,d  ) < P C(X  ,X  ) £ R.]  = 0 

— 1 1-1  — 0,d.  , 1 2 l 

1-1 


a contradiction.  If  x2  > £ then  a similar  contradiction  results  and  the  proof 
if  completed. 

Lemma  A. 4.  If  R.  satisfies  the  conditions  of  Lemma  A. 3 then  R.  u S 

l i 


satisfies  Condition  3.3  for  S = L^-LQ. 
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Proof . To  show  part  (1)  of  Condition  3.3  it  suffices  to  prove  that  x N2  ” 

is  a NWC  set  since  R.  u (L.  - L.)  = (N,  *N  - L.)  - U.  and  U.  c U c N * N - L. . 

l lO  120  l i0120 

Pick  y N2'  and  an¥  integers  *-(1^^  N^-x^)  and  kd^k^x^); 

it  must  be  shown  that  (x^jX^+il)  and  (x^-kjX^)  e N^xN2-LQ.  We  ^ave 

(x^Xj)  e x N2  - LQ  <-=^>  (x1>x2>  4 LQ  = <■=*=>  (N^-x^jN^x^  4 ^0 

=>  (N^+k,  N2-x2)  4 UQ  and  (H^,  N^x^l)  4 UQ  <=>  (x^k.lj)  4 tfQ  = LQ 

and  (xl5x2+Jl)  4 <=>(x1~k,x2)  and  (x^x^il)  e NixN2_L0‘  We  next  show 

that  JI-^R^  u s)  must  be  an  interval  of  integers.  Let  x^  = minll^R^  u S) 

and  x^  = maxII^CR^  u S).  Suppose  there  exists  an  integer  x^,  x^  < x^  < x^, 

■for  which  x^  4 H^(R^  u S).  It  follows  that  each  (x^,£)  must  be  in  LQ 

or  li.  for  i = 0,...,N2.  We  claim  that  (x^N^  4 L0  and  (x^jO)  4 U. . 

If  (xl5N2)  e LQ  then  (x^N^  4 H1  * N2  - LQ  =>  (x^i.)  4 ^ * N2  - LQ  c R u S 

for  every  l contradicting  the  assumption  that  x^  6 II^CR^  u ^).  Similarly 

if  (x^.0)  e ii.  then  (i,j)  e for  all  integers  0 <_  i <_  x^  and 

0 < j < N2  contradicting  the  assumption  Xj  £ H^R^  u S^*  Hence  it  must  be  that 

(x^jO)  e Lp  and  (x^,N2>  e U^.  It  follows  that  x2  = minIl^(U^|x^)  is 

(strictly)  positive  and  (x^jX^-l)  e LQ  c L^.  But  this  contradicts  Lemma  A. 3 

and  (x^,x2)  e IM  A similar  argument  shows  that  B2(R^  u S)  is  an  interval 

of  integers  and  concludes  the  proof  that  part  (2)  of  Condition  3 holds.  We 

begin  the  proof  that  part  (3)  of  Condition  3.3  holds  by  choosing  a e R1  and 

6 > 0 so  that  part  (3)  holds  for  R. . Let  s,  ,1,  >S.  and  L.  be  defined  for 

i k k j ] 

R.  as  in  part (3)  and  s* , 1*  , S*  and  L*  be  defined  for  R.  u S in  a similar 

i k k ] 2 l 

fashion.  Pick  m e II,  (R.  u S)  having  0 < s*.  If  m e n,(R.)  then 

1 l m 1 l 

s*  < s since  R.  c S u R.  =>  s >0  and  so  s*  < s < a+gm.  Now  suppose 
m — m l im  m — m — 

m 4 MR. ) then  (m,s*)  e L.  - L„.  Two  subcases  must  be  considered: 
li  m i o 

(1)  s*  € MR. ) and  (2)  s*  4 H_(R.).  In  the  first  subcase  we 
m z i m 2 l 


< m or  m < S . since  (m,s*)  4 R. . The  case  m < S 


s* 

m 


s* 

m 


is  impossible  since 


(S  *,s*)  e U[  =>  (m,s*)  e U'  contradicting  the  assumption  (m.s*)  e L.. 
s„  m i m mi 

m 

So  < m £ N2  an<^  si-nce  R-  satisfies  Condition  3.3  it  follows  that 

m s*  -a 

L*.{  >_  m > L _>  — - — or  at  6m  > s*.  In  subcase  2 it  must  be  that 
s s p m 

m m 

{(0,s*) , . . . ,(m,s*)}  c L.  - L otherwise  there  exists  an  x*  satisfying 
m m i 0 l J ° 

(x*  s*)  e U.  and  (x*+l,s*)  e L.  a contradiction.  This  shows  that 

1 m l 1 m l 

R.  c {(y,»yo)|0  < y < m and  s*  < y„  < N0}.  So  for  any  y,  e II,  (R.) 
i ± z — i m z — z 1 1 i 

we  have  y,  < m and  s > s*  > 0 and  hence  s*  < s < a+Sy  < a+Bm  => 
1 y1  m - m y^  — 1 

The  remaining  three  cases  follow  from  analogous  arguments  and  complete 
the  proof. 
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